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Abstract 

Quantum entanglement distillation protocols are LOCC protocols between Alice and Bob that convert 
imperfect EPR pairs, or, in general, partially entangled bipartite states into perfect or near-perfect EPR pairs. 
The classical communication complexity of these protocols is the minimal amount of classical communication 
needed for the conversion. In this paper, we focus on the communication complexity of protocols that operate 
with incomplete information, i.e., where the inputs are mixed states and/or prepared adversarially. 

We study 3 models of imperfect EPR pairs. In the measure-r model, r out of n EPR pairs are measured 
by an adversary; in the depolarization model, Bob's share of qubits underwent a depolarization channel; in 
the fidelity model, the only information Alice and Bob possess is the fidelity of the shared state. 

For the measure-r model and the depolarization model, we prove tight and almost-tight bounds on the 
outcome of LOCC protocols that don't use communication. For the fidelity model, we prove a lower bound on 



the communication complexity that matches the upper bound given by Ambainis, Smith, and Yang |ASY02 



1 Introduction 

1.1 Entanglement Distillation Protocols 

Quantum entanglement plays a central role in quantum information theory. The phenomenon of having entangled 
states separated by space, is one of the quintessential features in quantum mechanics. In fact, one of the most 
fundamental problems in quantum information theory is to understand entanglement. In particular, a very 
important question is how to quantify entanglement: how do we measure the amount the entanglement of a 
general bipartite state? 

Not only is q uantum entanglement conceptually interesting, it is very useful "in practice". If Alice and Bob 



share EPR pairs [EPR35|, then they can perform teleportation [ BBC+93[ : Alice can transmit an unknown qubit 
to Bob by simply sending 2 classical bits. In this sense, shared EPR pairs (paired with a classical commu nication 
channel) are equivalent to a quantum channel. Furthermore, EPR pairs make "superdense coding" [BW92| 
possible, where Alice can transmit 2 classical bits to Bob by only sending one qubit, provided that Alice and 
Bob share an EPR pair a priori. However, qubits are prone to errors, and EPR pairs may decohere and become 
imperfect. Can Alice and Bob perform reliable teleportation and superdense coding if they share imperfect EPR 
pairs? 

Entanglement Distillation Protocols (EDPs) provide answers to both questions mentioned above. Informally, 
EDPs are two-party protocols that take imperfect EPR pairs (or general entangled states) as input, and output 
bipartite states that are near-perfect EPR pairs. During the execution of the protocol, both parties (denoted by 
Alice and Bob) can perform local quantum operations (unitary transformations and measurements) on their share 
of qubits, and communicate classical information. Alice and Bob are not allowed to send qubits to each other. 
Protocols of this type are called "LOCC protocols" , standing for "Local Operation Classical Communication" . 
With EDPs, one can derive a quantity, namely the "distillable entanglement", for any bipartite state. The 
distillable entanglement of a state is the maximum number of EPR pairs Alice and Bob can output using the 
optimal EDP, which proved to be a very important quantity in measuring the amount of entanglement for bipartite 
states. This answers the first question we mentioned above. For the second question, Alice and Bob can engage in 
an EDP to "distill" near perfect EPR pairs from imperfect ones, and then use the distiled EPR pairs to perform 
teleportation and superdense coding reliably. 
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There have been numerous research efforts on entanglement distillation protocols. We only list some of the 
most relevant work here. 

To our knowledge, Bennett, Bernstein, Popescu, and Schumacher are the first t o consider the problem of 
producing EPR pairs from "less entangled" states. In their seminal paper ||BBP+96a |, they gave a protocol that 
converts many identical copies of pure state | 0) = (cos 6*1 01) + sin^?| 10)) to perfect EPR pairs. They call this 
process "e ntanglemen t concentration". In the same year, Bennett, Brassard, Popescu, Schumacher, Smolin, and 



i2p 

an arbit rary pu re state, see Vidal |V99| , 

Nielsen [VJNO0|. Much of the work was built on the result of majorization by Nielsen N99a 



Jonathan and Plenio JP99 



Wootters [ BBP+96b | studied the problem of "extracting" near-perfect EPR pairs from identical copies of mixed 
entangled states. This is the first time that the notion "entanglement purification protocols" was presented, which 
were renamed to "entanglement distillation protocols" later They also pointed out that EDPs can be used to 
send quantum information through a noisy channel. L ater, Bennett, DiVincenzo, Smolin and Wootters |BDS+96| 
improved the efficiency of the protocols in | BBP+96b | and proved a result that closely related EDPs to quantum 
error correcting codes, which is an alternative means to transmit quantum information reliably through a noisy 
channel. 

Horodecki, Horodecki, and Horodecki |HIIII9£, HIIII9(: | and Rains | R98a , R98b , R00| gave various asymptotic 
bounds on distillable entanglement for arbitrary entangled states. They considered the situation where n identical 
copies of a state are given as input to an LOCC protocol, which then outputs m EPR pairs. They studied the 
asympototic behavior of m/n as n app roaches infinity. Researchers also studied EDPs for a single copy of 

Hardy |H99|, an d Vida l, Jonathan, and 

who is the first 

one that studied conditions under which one pure state can be transformed into another one by LOCC. All the 
work above assumes that Alice and Bob know the explicit description of the state they share, and so they can 
act optimally. 

From another direction, researchers have studied EDPs with incomplete information, where Alice and Bob 
don't know the exact state they share. The state is in a mixed state, or is prepared adversarially. In this case 
we cannot hope that Alice and Bob would act optimally. However, there still exist protocols that do reasonably 
well. Bennett et. al [BBP+96b, BDS+96| studied the model where Bob's share in the EPR pairs underwent a 
noisy channel, resulting in a mixed state. They showed that their protocol would "distill" near-perfect EPR pairs 
even when Alice and Bob don't have the complete knowledge of the shared state. Under anot her cir cumstance, 
"purity-testing protocols" were studied implicitly by Lo and C hau ||LC99 |, Shor and Preskill [3P0C|, and later 
explicitly by Barnum, Crepeau, Gottesman, Smith, and Tapp BCG+0^ . Purity-testing protocols are LOCC 
protocols t hat app roximately distinguish the state of perfect EPR pairs from the rest states. Ambainis, Smith, 
and Yang [ASY02] pointed out that purity-testing protocols are indeed EDPs where Alice and Bob only know 
the fidelity of the state they share. Using constructions from |BCG+02|, Ambainis, Smith and Yang constructed 
a "Random Hash" protocol that produces {n — s) EPR pairs of conditional fidelity at least 1 — 2^"/ (1 — e) on any 
n qubit-pair input state of fidelity 1 — e. Their protocol would fail with probability e, and the conditional fidelity 
of its output is the fidelity conditioned on that the protocol doesn't fail. 



1.2 Communication Complexity 

Classical communication complexity studies the minimal number of (classical) bits needed to be tr ansmitted 
between multiple parties in order to collectively perform certain computation. P ioneere d by Yao | Y79 |, it is now 
a very rich field in theoretical computer science, and the readers are refereed to [ KN97 | for more information. 

Quantum communication complexity, on the other hand, mostly studies the minimal number of qub its n eeded 
to be exchanged in order to perform (quantum) computation. This field is also first studied by Yao | Y93 ] , and 
now it is becoming one of the central topics in quant um information theory. Numerous results have emerged, and 
we refer the readers to | B01 | for a nice survey, and | BBC+98 , KOla , KOlb R02| for some important techniques 
and results. 

However, another class of problem, namely the classical communication complexity for quantum protocols, 
has being largely ignored, until very recently. This class of problem is concerned with the minimal number of 
classical bits needed to be communicated to perform certain quantum computation. An example is the classical 
communication complexity for EDPs: one may ask "how many bits do Alice and Bob need to exchange in order 
to distill n EPR pairs?" One reason that not many researchers pay too much attention to this problem might be 
the conception that classical communication is " cheap" compared to quantum communication, and thus can be 
ignored. However, as pointed by Lo and Popescu | LP9£ ], there are situations where cl assical communication isn't 
"that" cheap that can be justifiably ignored. One example is the super-dense coding [ BW92 |: Alice and Bob can 
use n qubits to transmit 2n bits of classical information, if they share n EPR pairs. Nevertheless, if it takes more 
than n bits of classical communication to distill the n EPR pairs, it would totally destroy the purpose of super- 
dense coding. Furthermore, in the study of LOCC protocols over quantum states, no quantum communication 



^The original name, "entanglement purification protocols" 
mechanics has other meanings. 



was considered less appropriate, since "purification" in quantum 
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takes place, and it is interesting to study the classical communication complexity of these (quantum) protocols. 
The history of classical communi cation c omplexity for quantum protocols can probably traced back to the seminal 
paper by Bennett and Wienser | BW92 |, which discussed teleportation and constructed a protocol that u ses 2n 
classical bits to transmit n qub its. H owever, this topic was largely overlooked until by Lo and Popescu |LP99| 



and Lo | L99 . Lo and Pop escu LP99 discussed the classical communication complexity of v arious prot ocols by 
Bennett et. al. [BBP+96a . They observed that the "entanglement concentration protocol" in |BBP+96a| doesn't 



require any classical communication, while the "entanglement dilution protoc ol" req uires 0{n) bits of classical 
communication for producing n copies of the "diluted" state. Lo and Popescu |LP99| constructed a new dilution 
protocol that only uses 0(\/n) bits of communication. This protocol was proven to be asymptotically optimal 
by Hayden and Winter [ HW02|] , and Har row and Lo [ HL02 , who proved matching lower bounds for general 
entanglement dilution protocols. Lo ||L99|| studied the communication complexity for Alice and Bob to jointly 
prepare many copies of arbitrary (known) pure states, and proved an non-trivial upper bound. All the results 
above focus on a relatively simple situation, where the input are n copies of a known pure state, and only the 
asymptotic results are known, i.e., the ratio of the amount of communication to n as n approaches infinity. 



1.3 Our Contribution 

In this paper, we study the classical communication complexity of EDPs with incomplete information. In this 
setting, Alice and Bob don't have the complete knowledge about the input state they share. Rather, the input 
state is a mixed state, or is adversarially prepared. This is a natural extension to the simple model, where 
Alice and Bob share a pure state. In fact, we argue that this is a more "realistic setting": it is very hard, if 
not impossible, to know precisely which pure state a quantum system is in. Some quite natural and commonly 
studied models of "noise" in quantum state are probabilistic in natural, and necessarily result in a mixed state. 
An example is the depolarization channel. Furthermore, EDPs that work with adversarially prepared states have 
the inherent worst-case behavior guarantee, and it more robust than EDPs designed only specifically for some 
known pure states. It is, therefore, very desirable to understand the communication complexity of EDPs that 
work in this setting. 

We also study the precise communication complexity of EDPs, rather than their asymptotic behavior. In 
fact, we try to answer questions of the following fashion: "On this particular input state class, how many bits 
of classical communication are needed in order to just output a single EPR pair with certain quality?" We feel 
that it is important to understand the communication complexity in this case, where the requirement seems to be 
minimal. Interestingly, as we shall see later, the answer to this minimal question already yields a lot of insights 
into the more general problem, where Alice and Bob wish to generate EPR pairs of not only high quality, but 
also of large quantity. 

To the best of our knowledge, this is the first paper that studies classical communication complexity of EDPs 
with incomplete information, and also the first paper to address the precise communication cost, rather than the 
asymptotic behavior. In fact, the only prior result that studied classical communication complexity of EDPs we 
are aware of is for th e spec ific "entanglement concentration protocol" by Bennett et. al. [BBP+96a . As pointed 
by Lo and Popescu | LP99 |, this protocol doesn't need any classical communication. Notice that this particular 
protocol is the first EDP that appears in literature, and works in perhaps the simplest possible setting, where the 
input is a large copy of identical pure states. For all the related work on classical communication complexity we 
are aware of, they all work with a relative simple model. In this model, Alice and Bob try to convert many copies 
of some pure state | 0) into many copies of some other state | ip) . The fact that only many copies of identical pure 
states are considered (and only asymptotic results are needed) makes a lot of techniques available, for example 
the Law of Large Nun ibcrs, the Centra l Limit Theo r em, and the conversion of multiple-round protocols into 
single- round protocols | BBP+96a , N99a , NCOO , LP99 , HL02 , HW02|| . These techniques no longer work when we 



move to mixed input states and ask for precise communication complexity. 

As another motivation, we point out that, as EDPs are closely related to Quantum Error Correcting Codes 
(QECCs), the communication complexity of EDPs is closely related to the efficiency of QECCs. Quantum 
error correcting codes are schemes to encode quantum states redundantly, such that if part of the states are 
corrupted, one can still recover the original encoded state. With QECC, Alice is able to transmit quanturn 



S96, G97, NCOC, P00| 



states reliably through a noisy quantum channel to Bob. The readers are referred to |p 
for more discussions on QECCs. One of the central issues concerning QECCs is to design QECCs that are 
effi cient (i.e., has low re dundancy) and robust (i.e., tolerate a wide range of noise). As pointed by Bennett et. 
al. BBP+96b , BDS-f96] , entanglement distillation protocols can also be use to transmit quantum states reliably 
through a noisy channel. Alice produces EPR pairs and sends Bob's share through the noisy channel. Then 
Alice and Bob engage in an EDP to "distill" near-perfect EPR pairs. Finally Alice and Bob use the shared near- 
perfect EPR pairs to perform teleportation and transmit the quantum states reliably. From this point of view, 
entang lement di stillation protocols can be thought as "interactive error correcting protocols". In fact, Bennett 
et. al. [ BDS+96 proved a relationship connecting QECCs and EDPs: they proved that QECCs and 1-way EDPs 
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(where only Alice sends information to Bob and Bob doesn't send anything back) are essentially equivalent. From 
any 1-way EDP, one can derive a QECC with the same parameter, and vise versa. They also showed that 2-way 
EDPs are more powerful than QECCs in that there exists a noisy channel for which no QECC is possible, but 
there exists 2-way EDPs that can transmit information through this channel. The communication complexity 
of EDPs somewhat corresponds to the redundancy of QECCs. As in the case of QECCs, it is therefore very 
desirable to construct EDPs of low communication complexity that tolerate a high level of noise. In this setting, 
the noise model is often adversarial or probabilistic, and both precise and asymptotic results on communication 
complexity are important. 

We study EDPs in 3 different settings, corresponding to 3 different models of "imperfect" EPR pairs. The 
first model is called the measure-r model. In this model, Alice and Bob originally share n perfect EPR pairs, 
and then r out of these n pairs are measured in the computational basis. Each measured pair ends in a mixed 
state i(|00)(00| + |11)(11|), and becomes disentangled. Alice and Bob have no information about which pairs are 
measured and which are not, but they know r. In fact, we assume that the r measured pairs are adversarially 
chosen. This model is similar to the model used in error correcting codes (both classical and quantum). The 
second model is called the depolarization model. In this model, n perfect EPR pairs were produced by Alice, and 
then she sends Bob's share of n qubits to Bob through depolarization channel of parameter p. In other words, 
each of Bob's qubits is left unchanged independently with probability \ —p and is replaced by a completely mixed 
stat e with probability p. It is a typical model for "noisy channels", and in particular was studied by Bennett et. 
al. [|BBP+96b| , |BDS+96| . The third model is called the fidelity model. Here, Alice and Bob only know that the 
fidelity of their shared state and perfect EPR pairs is 1 — e. Alice and Bob don't have any other information 
about the state. This is the model considered by Ambainis et. al. |ASY02 |, where they called it the "general 
error" model 

We obtain the following results: For the measure-r model, we obtain a tight upper bound on the fidelity of 
the output of protocols that don't use communication. More precisely, we prove that in the measure-r model, the 
maximal fidelity of a protocol is at most 1 — r/2n, if no communication is involved. Here we define the fidelity of 
a protocol to be the worse-cast fidelity of the output of this protocol and the perfect EPR pairs. This bound is 
tight in that we also present a (very simple) protocol that achieves a fidelity of 1 — r/2n. Interestingly, the proof 
seems quite non-trivial for this seemingly simple statement (and the trivial protocol that matches the bound) . For 
the depolarization model, we obtain an almost-tight, similar bound. We prove that in the depolarization model, 
the maximum fidelity of a protocol is 1 — p/2, if no communication is involved. This upper bound is almost tight, 
in that we also give a (very simple) protocol that achieves 1 — 3p/4. Both these 2 upper bounds are for protocols 
that are only required to output 1 qubit-pair, which seems to be the minimal requirement for a "useful" EDP. 
For the fidelity model, we give almost tight (up to an additive constant) bounds on communication complexity of 
EDPs. More precisely, we prove that the maximal conditional fidelity of an EDP of t bits of communication is at 
most 1 — e •p/2*+^, even if the EDP is only required to output 1 qubit pair. Here e is the fidelity of the input state, 
and p is the "ideal success probability" , which is the probability that the EDP succeeds with perfect EPR pairs 
(having fidelity 1) as input. Therefore, to achieve a fidelity or 1 — (5 on t he output, log(l/ 5) + log(e • p) — 1 bits of 
classical communication is needed. Comparing the result from [ASY02], which constructed a protocol that uses 
log(l/(5) -t- log(l — e) bits, our lower bound is tight up to an additive constant. Here we assume that both e and p 
are constant, which seems to be the reasonable assumption. One interesting observation is that our lower bound 
was proven for protocols what only output 1 qubit pair, while the matching upper bound is from a protocol that 
outputs many qubits (in fact, in the usual setting, the protocol outputs all but logarithmically number of input 
qubit pairs). This seems to indicate that the communication complexity is oblivious of the yield of the EDPs 
with respect the fidelity model. This fact is quite surprising, since it is definitely not the case for QECCs. 

All the proofs in out paper are from first principles and don't involve very complex analysis. Some techniques 
used in this paper would be interesting by themselves: in fact, as we pointed out earlier, the old techniques don't 
work any more in our setting, when mixed states and studied and we are interested in the precise communication 
complexity. Therefore, we need to use new techniques, among which are an alternative definition on fidelity, 
which proved very useful in proving the first 2 bounds, and an observation on the "splitting" of mixed states 
during communication, which is useful to prove the lower bound for the fidelity model. 



1.4 Outline of the Paper 

In Section |2[ we present some notations and definitions to be used in the rest of the paper. We prove a lower 
bound for the measure-r model in Section ||. We prove a lower bound for the fidelity model in Section ^. In 
Section ^ we prove the lower bound for the fidelity model. We conclude the paper in Section ^ Some proofs are 
postponed to the Appendix. 

^We feel that the name "general error" model isn't appropriate since this error model isn't the most general one. For example, it 
is not compatible with the previous error models. 
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2 Notations and Definitions 



All logarithms are base-2. We identify an integer with the 0-1 vector obtained from its binary representation. For 
a vector we write v[j] to denote its j-th entry. For 0-1 vector x, we denote its Hamming weight by \x\, which 
is the number of I's in x. We define B — {0, 1}, and naturally = {0, 1}". For binary strings x and y, we use 
x; y to denote the concatenation of these 2 strings. 

Throughout the paper we are interested in finite, bipartite, symmetric quantum systems shared between Alice 
and Bob. We identify a "ket" | (j>) with a unit column vector. We assume there exists a canonical computational 
basis for any finite Hilbert space of dimension N, and we denote it by {| 0), 1 1), | N — 1)}. We use superscripts 
to indicate which "side" a qubit or an operation belongs to. For example, a general bipartite state | (f) can written 

There are 4 Bell states for a pair of qubits shared between Alice and Bob, and we denote them as follows: 

(1) 
(2) 
(3) 
(4) 

We denote the state ($+)®", which represents n perfect EPR pairs, by We also abuse the notation to 
use to denote both the vector \l/„ and its density matrix n|5'„) (^I^nl, when there is no danger of confusion. 

A quantum state is disentangled if it is of the form | -0)'^ (g) | ip') ■ Any other pure state is entangled. A mixed 
state p is disentangled if and only if it is equivalent to a state that is a mixture of disentangled pure states. Any 
other mixed state is entangled. 

The Pauli Matrices X, Y, and Z are unitary operations over a single qubit defined as 

X{a\0) + /3\1)) = P\0)+a\l) 
Y{a\0)+(3\1)) = i(3\0)-ia\l) 
Zia\0) + (3\1)) = a|0)-/3|l) 

We use / to denote the identity operator. 

For a unitary operator U, we can write it in a matrix form under the computational basis. Then we define its 
conjugate, U* , to the entry- wise conjugate of U . Clearly U* is still a unitary operation. 

An error model is simply a set of bipartite (mixed) states, and is often denoted by M. We say a state p is 
consistent with M,il p ^ M. 



$+ 




\0)^\ 


0)^ 






1)^) 






\0)^\ 


0)^ 












\0)^\ 


1)^ 












\0)^\ 


1)^ 






0)^) 



2.1 Fidelity 

For two (mixed) states p and a in the same Hilbert space their fidelity is defined as 

F{p, a) = Tr2(Vpi/2CTpi/2). (5) 

Notice we are using a different definition as in [NCOO], where the square root of (||) is used. 
li a — \ip){ip\ is a pure state, the definition simplifies to 

F{p,\^){^\)^{^\p\^) (6) 

A special case for the fidelity is when | if) = for some n, such that p and have the same dimension. In 
this case, we call the fidelity of p and | (p) the fidelity of state p, and the definition simplifies to: 

F(p) = (*„ IpI vI/„) (7) 

We are often interested in the fidelity of 2 states of unequal dimensions. In particular, we are interested in 
the fidelity of a general bipartite state p, and the Bell state $+. This coincides with the definition of fidelity 
when p has dimension 2. When p has a higher dimension, we define its base fidelity to be the fidelity of the state 
obtained by tracing out all but the first qubit pair of p. We denote the base fidelity of p by F{p). 

It is easy to verify that the fidelity is linear with respect to ensembles, so long as one of the inputs is a pure 
state, as in the following claim. 
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Claim 1 If p is the density matrix for a mixed state that is an ensemble {pi, \ ((>{)}, and a is the density matrix 
of a pure state, then we have F(p,cr) = X^iPi ' P{\4'i){4'i\T'^)- • 



The fidelity is also monotone with respect to trace-preserving operations | NCOO| 



Claim 2 For any states p and a and any trace-preserving operator £ , we have F {£ {p) , £ [cr)) > F{p,(j). ■ 

One useful fact about fidelity is that any completely disentangled state has base fidelity at most 1/2. 
Lemma 1 If p is a completely disentangled state, then F[p) < 1/2. 

Proof: By the definition of base fidelity, we may assume that p has dimension 2. By Claim |l|, we only need to 
consider the case that p is a pure state \4)){(j)\. Since | 0) is disentangled, we may write it as 

|0) = (ao|O)+ai|l))®(/3o|O)+/3i|l)) 

Then a direction calculation reveals that 



2 



2 

< I {\ao\W + \c^i\W + l«o/3i*P + l«i/3o1') 
= li\ao\' + \a,\'){\p,\^ + \f3,\^) 

_ 1 
2 



2.2 Entanglement Distillation Protocols 

We give a detailed description on entanglement distillation protocols discussed in our paper. We often denote 
an entanglement distillation protocol by V. The protocol starts with a mixed state p shared between Alice and 
Bob. Alice and Bob can have their private ancillary qubits, originally initialized to |0). A protocol is either 
deterministic or probabilistic. For deterministic protocols, Alice and Bob don't share any initial random bits; for 
probabilistic protocols, Ahce and Bob share a (classical) random string. We say a protocol P is a t-h\t protocol, 
if there are t bits of (classical) communication during the protocol. We don't allow protocols to have any initial 
entanglement as auxiliary inputs, and neither do we allow quantum channels between Alice and Bob. 

An the end of a protocol, both parties output m qubits, which form the output of the protocol. In addition, 
Alice also outputs a special symbol (either a SUCC or a FAIL). The success probability of a protocol V over an 
input state p is the probability that Alice outputs SUCC at the end of the protocol, and we write this as '-'•[p]. 
The ideal success probability of a protocol V is its success probability over input \E'„. We say a protocol is ideal, 
if its ideal success probability is 1. If cr is the density matrix of the output of protocol V on input p, we write it 
as V{p) = a. If T is the density matrix of the output of protocol V on input p, conditioned on that Alice outputs 
SUCC, then we call r the conditional output of protocol V, and write this as T"^{p) = r. 

For an entanglement distillation protocol V , we define its fidelity with respect to an error model AA, denoted 
by Fj^{V), to be the minimal fidelity of its output over all input state consistent with JV[. In other words, 

FM{V)=u^inF{V{p)) (8) 

Similarly, we define the conditional fidelity to be the minimal fidelity of its conditional output, denoted by Fj^ (V): 

F%,{V) = n,mF{V^{p)) (9) 
peM 

When the error model is clear from the context, it is often omitted. 

3 The Measure-r Model 

We prove an upper bound on the fidelity of 0-bit EDPs with respect to the measure-r error model. 
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3.1 Notations and Definitions 

Wc start with more notations and definitions. 

A binary indicator vector, often denoted by v, is an n-dimensional vector, whose each entry is an element from 
{0, 1, *}. The degree of a binary indicator vector v is the number of entries that are not *, and we write this as 
deg(v). There are 2*" • (") binary indicator vectors of degree r. Each binary indicator vector v corresponds to a 

unique bipartite quantum state | (p-y) in in the following way: 

n-i ( \0)^\0)^ ifv[i] = 

I '/'v) = (g) I 0,), where | 0,) = <^ | 1)^ if v[j] = 1 
j=o [ if v\j]=* 

The state | i/iv) is called an error state, where v is called its error indicator vector. The degree of state | cpy) 
is the degree of its indicator vector. The error model for the measure-r model, denoted by M!^,., is defined to be 

M^,r = {\ (jw) I V is an n-dimensional binary indicator such that deg(v) = r} (10) 

An n-dimensional 0-1 vector x is consistent with a binary indicator vector v, if x[j] = v[j] for all j such that 
v[j] ^ *. We write this as x C v. For any v of degree r, there are 2""'' 0-1 vectors x consistent with v. It is not 
hard to verify that 

i^v) = ^(^Ei-)^i-)" (11) 

3.2 Two Useful Lemmas 

We prove 2 lemmas that would be useful for the proofs in this paper. Both lemmas are about how much "deviation" 
a quantum state undergoes when applied various unitary operations. 

First, we consider the "deviation" of an arbitrary pure state under the operations {/, X, Y, Z} over its first 
qubit. We have the following lemma: 

Lemma 2 Let \ <p) and \ tjj) be two pure states of the same dimension, not necessarily bipartite. Let I, X, Y, and 
Z be the unitary operations over the first qubit of\(j)). Then we have 

E K'A|t^l^)l'<2 (12) 
ue{i,x,Y,z} 

Proof: We write | 0) = ao| 0)| ./.q) + ai 1 1)| and | ^) = /3o| 0)| ^o) + A] 1)| Vi) 
Then we have 

{(j>\I\tp) = a(5/3o(0o I i/'o) + I V'l) 

{cp\X\i;) = alPo{<l>i\i;o) +a*f3i{i^o\cl)i) 
{^\Y\ij) = -iai/3o*(</>i|^o)+iao/?r(<^o|^/'i) 

Therefore 

E \{<P\U\i^)f = 2|ao/3o|'|(0o|V'o>|' + 2|ai/3i|2|(</)i|V'i)|' + 2|ao/?i|'|(</'o|^i)|' + 2|ai/?o|'|(<^^ 
ue{i,x,Y,z} 

< 2\ao\W + 2|ain/3iP + 2|ao|'|/?i|' + 2\aif\pof 
= 2(|aor + |ai|2)(|/3or + |/3i|2) 
= 2 



An immediate corollary is 
Corollary 1 Let | </>) be a pure sate. We have ^ue{i,x,Y,z} \ {^ 1^1 — 2- 



7 



Next, wc consider quantum states and operations over bipartite systems. In particular, we study the "devi- 
ation" of a general bipartite state under unitary operations of the form U ®U*. We interpret U ®U* &s Alice 
applies U to her first qubit and Bob applies U* to his first qubit. Again, we consider U e {/, X, Y, Z}. 

We have the following lemma. 

Lemma 3 Let \(f>) be a pure state in a bipartite system shared between Alice and Bob. Let L , X ® X* , Y ® Y* , 
and Z ® Z* be the unitary operations over the first All these 4 operations work on the first qubit of Alice and the 
first qubit of Bob. Then we have 

(0 1 0) + |(x ® x*)\ 0) + |(y ® 0) + (0 |(z ® z*)\ 4>) = 4F(| 4>)) (13) 

Proof: We first consider how the Bell states behave under these unitary operations. It is easy to verify the 
result, which we compile into the following table. 



Table 1: The Bell States under operators 



state 










i®r 


$+ 






*- 


X0X* 


$+ 








Y(»Y* 


$+ 








Z(g)Z* 











It is easy to see that the state is invariant under any of the 4 operations, while other Bell states will change 
their signs under some operations. 

Notice the 4 Bell states form an orthonormal basis for a bipartite system of 2 qubits. We decompose | (p) into 
the Bell basis and write 

I 0) = ao4'+ <»\ipo)+ «> I V'l) + "2*"^ O I V'2) + as^^ (g) I tpa) 

where J2^=o ~ Therefore we have 

(</.!(/.) = |aoP + |ai|2 + |a2|2 + |a3|' 
= |ao|'-|«i|' + |a2|'-|a3|' 
(<^|(y®r*)|(/.) = |aop-|ai|2-|a2p + |a3r 
(<^|(Z®Z* )!</.) = |ao|2 + |a,|2_|„2|2_|a3|2 

and so, 

(01 0) + (<^|(X ® X*)\<l>) + (<A|(y Y*)\ cl>) + {<f>\{Z ^ Z*)\<f>) = 4|aoP = 4F(| <f>)) 

■ 

The above lemma implies an alternative definition of the base fidelity of a pure state. 



3.3 A Tight Bound for the No-Communication Case 

We prove that the fidelity of 0-bit EDPs for the measure-r error model is at most 1 — r/2n, even if the protocols 
are only required to output one qubit-pair. Notice that fidehty is monotone. Therefore if no protocol can output 
a single qubit pair of fidelity at least 1 — r/2n, then no protocol can output multiple qubit pairs of fidelity at least 

1 - r/2n. 

Theorem 1 For any probabilistic 0-bit protocol V that outputs one qubit pair, we have F(V) < 1 — ^ with respect 
to the measure-r model. 

Notice that there exists a very simple probabilistic 0-bit protocol that has fidelity 1 — Alice and Bob use their 
shared random string to uniformly pick an EPR pair and output it. If this pair is measured, (which happens with 
probability r/n), the fidelity is 1/2, and otherwise it is 1. So the overall fidelity is exactly 1 — r/2n. So our upper 
bound is tight. 
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Proof: We consider a slightly different error model, where a random r out of n EPR pairs are measured. This 
corresponds to the density matrix 

yrJ V:dcgV=r 

Notice that this is the "average case" version of the measure-r model. Thus if we prove an upper bound on the 
fidelity of V over p, then it is also an upper bound with respect to the measure-r model. 

We shall prove that no deterministic 0-bit protocol can have a fidelity higher than 1 — r/2n if p is the input. 
Then, we conclude that no probabilistic protocol can have a fidelity higher than 1 — r/2n, too, since fidelity is 
linear. 

Notice V doesn't involve any communication, we can model it as Alice and Bob both applying a unitary 
operation to their share of qubits, outputs the first qubit and discard the rest. 

Suppose the unitary operators of Alice and Bob are Ua and Ub- We denote the states under these operations 

by 



Ua\x) 
Ub\x) 



Notice that we use " — >" instead of "=" since we allow Alice and Bob to use ancillary bits. Clearly, the vectors 
{| 4'x)}x are orthonormal, and so are the vectors {| ipx)}x- 
We shall prove that 



^ [[^((f/A®C/B)|0v)(0v|(C/A®C/s)t) 



dcg V— r 



< 1 



2n' 



which shall imply our lemma. By Lemma ^ (^) is equivalent to 



{(b^,mA^UB)\U^U*)iUA^UB)\(b^) 

ue{i,x,Y,z} 



^rJ dcgV=r 

We expand the left hand side: Notice that 

{Ua ® Ub)\ 0v> = rr;^ E I '^-) I 



(14) 



(15) 



and so we have 



/'v|(C/A®C/i3)^(C/®C/*)(C/A®C/B)kv) = ;5;^E E^'^- 1^1'^?^) -^^^ 1^*1^!^ 



for any unitary operation U. So, (|lj) is equivalent to 

2n C"-) E E E E ^ 

yrJ dcgV=r xHV ynv Ue{I,X,Y,Z} 

However, by Cauchy-Schwartz, we have 



2n 



dcg V=r xCV j/CV Ue{I,X,Y,Z} 



(16) 



M E EE E 

y^dcgV=r xnv ynv Ue{I :X ,Y,Z} 



'yi 



i dcgV=r xCVyCV U&{I,X,Y,Z} 



Next, we estimate the terms on the right hand side: 

1^ = EE E K^^it^i' 



E EE E K^^it^i' 

eg V=r xCV yCV U&{I,X,Y,Z} 



X y UG{I,X,Y,Z} 



E 
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Notice that since | are all orthonormal, we have ^ \{<j)x \U\ (jjy)]'^ < 1 for all x's. Thus 



EE E \{<^^\u\^.)\'<2-+' 

X V U£{I,X,Y,Z} 



For any x and y, we have 



E 



1 = 



n — \x ®y\ 

n — r — \x (By\ 



degV— r : a:IZVAyCV 

The reason is simple: the only freedom for v is where to put the {n — r) *'s. But for every position k such that 
x[k] ^ y[k], we have to have v[fc] — *. Then we still have {n — r — \x (B y\) *'s we can put anywhere. So if a; ^ y, 



E 



1 < 



dcgV=r : 2;CVAi/CV 



n-1 
n — r — 1 



Also notice that by Lemma ||, we have X];7g{/ x y z} K"?^^ 1^1 — 2 for any 
Putting things together, we have 



E EE E K^^if^i' 

og V=r xQ-V yCV Ui£{I,X,Y,Z} 



< 



X ue{i,x,Y,z} 
n\ fn — \ 
r ) V r — 1 



n — 1 



J-E E K'^^it^i' 



a:7^y ;76{/,X,Y,Z} 



E E \{'t>'Au\4>.)? + 

X ue{i,x,Y,z} 



71—1 

r 



; -EE E K^^i^i 

n\ /n ~ 1 
r I \r — 1 



Similarly, we have 



too. 



Thus we have 



which proves (|l^). 



E EE E i(^.it/i^.)P<2"-^r (1-^) 



E EE E (0.ic/i0.>-(V'.if/i^.)<2"+^(")(i-:^) 



;V=ra;[ZV y[ZV UG{I,X,Y,Z} 



r I 2n 



4 The Depolarization Model 

We prove an upper bound on the fidelity of 0-bit EDPs with respect to the depolarization model. 



4.1 Notations and Definitions 

We give notations and definitions used in this section. 

We firs t descr ibe the depolarization channel. A depolarization channel 2? of parameter p is a super-operator 
defined as |NCO0| 

In other words, this channel behaves in the following manner: with probability (1 — p), it keeps the state 
untouched, and with probability p, it replaces that with the completely mixed state. 



10 



It is not hard to verify that after passing the second qubit through this channel, the state becomes a mixed 
state 

= (1 - f )|1'+>(<I>+I + f + + l*->(*-|) 

The depolarization error model of n qubit pairs and parameter n, denoted as p, consists of a single state: 

Mi, = {pf-}. 

4.2 An Almost-Tight Bound for the No-Communication Case 

We prove that the maximal fidelity of 0-bit EDPs for the depolarization error model is 1 — p/2, even if the 
protocols are only required to output one qubit-pair. 

Theorem 2 For any probabilistic 0-bit protocol V that outputs one qubit pair, we have F(V) < 1 — f with respect 
to the depolarization model. 

There exists a very simple deterministic 0-bit protocol that has fidelity 1 — ^ : Alice and Bob simply output the 
first qubit pair. It is very easy to verify that the fidelity of this protocol is 1 — Therefore the bound in the 
theorem is almost-tight (by a constant factor). 

The proof to Theorem g is very similar to that to Theorem |l], except that it is more technical. We postpone 
the proof to Appendix 



5 The Fidelity Model 

We study the communication complexity of EDPs with respect to the fidelity error model. 

First, we give the definition of the fidelity error model. For a bipartite system of n qubit pairs, we define the 
fidelity error model of parameter e to be the set of all bipartite systems of fidelity at least 1 — e. We denote the 
error model by 

mI^, ^{p\p has dimension 2^" and F{p) > 1 - e} (17) 

Notice that this error model is very different from the two previous models we studied, since it provides much 
less information than the previous one. As a comparison, notice that in the measure-r model, all the error states 
have fidelity 1/2'", and in the depolarization model, the fidelity of the input is (1 — 3p/4)", both are very small. 
However, Alice and Bob have the additional information about the structure of the input states, and are able to 
use the information to do very well. 



5.1 Two Useful Facts About Positive Operators 

We present two useful facts about positive operators, used in the rest of the paper. 

For two positive operators A and B, we say A dominates B, if A — B is still a positive operator, and we write 
this a,s Ay B, or equivalently, B ^ A. 

Claim 3 For any positive super- operator £ and any positive operators A and B, if Ay B, then £{A) y £{B). ■ 

This directly follows the fact that £ is linear and preserves the positivity of operators: If A — B is a positive 
operator, then £{A) — £{B) — £{A — B) is also a positive operator. 

Claim 4 Let p and a be density matrices such that p y a ■ a, for some positive number a. For any POVM 
{E„i}, let pm — TT{pEm) and and q„i — Tr((T_E,„) be the probabilities the measurement result being m for p and 
(7, respectively. Then we have pm > a • 9m- ■ 

This is obvious, since we have Pm ~ a ■ — Tr((p — a ■ a)Em) > 0. 
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5.2 Upper and Lower Bounds for the Fidelity Model 

Ambainis, Smith, and Yang |ASY02 | proved that in the fidehty error model of parameter e (which they called 
the "general error model"), the maximal fidelity of a protocol is 1 — ^ -yt^x ^- if the protocol has n qubit 
pairs as input, fc perfect EPR pairs as auxiliary input, and outputs m qubit pairs. In a special case where fc = 
(no auxiliary input) and m — \ (only one pair is output), the maximal fidelity is 1 — ^^mf f ^ ^/^' f^ 
other words, no "interesting" entanglement distillation protocols exist for the fidelity error model. Their result 
is tight, in that they also constructed a protocol, namely the "Random Permutation Protocol", which achieves a 
fidelity of 1 - ^Il^^^e. One can slightly modify this protocol to completely eliminate communication, and 
still maintain a high fidelity. In the original construction of the random permutation protocol, communication 
is used in 2 places. First, Alice and Bob communicate to agree on a common random permutation. This part 
of communication is not needed for a probabilistic protocol. Second, Alice and Bob communicate to check if 
their measurements agree. We can modify the protocol by having Alice and Bob always "pretend" that they 
measurements agree. A careful analysis shows that this modification won't change the fidelity of the protocol by 
much. In fact, we have the following theorem: 

Theorem 3 There exists a probabilistic 0-bit entanglement distillation protocol of fidelity 1 — | < 1 — e/2 
with respect to the fidelity model of parameter e. ■ 

The situation for conditional fidelity is very different. In fact, Ambainis et. al. proved that good pr otocols 



exists with high conditional fidelity. In particular, the following result can be easily derived from | ASY02| 



Theorem 4 ([ ASY02|) For every n and s < n, there exists probabilistic s-bit entanglement distillation protocols 



of conditional fidelity 1 — 2 "/(l — e) with respect to the fidelity model of parameter e. 



Proof's sk etch: Consider the "Simple Random Hash" protocol in |ASY02|. The original construction for this 



protocol in |ASY02] has {2n + 2) bits of 2-way communication. But a close examination reveals that 1 bit of 
1-way communication suffices. In the original construction, Alice sends 2n bits to Bob to establish a common 
random string, which are not needed for a probabilistic protocol. In the original protocol. Bob also sends 1 bit 
of his measurement result back to Alice. This bit can also be eliminated in our model, since we allow one player 
(normally Alice) to output a special symbol at the end of the protocol. We then repeat the simplified 1-bit 
protocol for s rounds sequentially, and obtain an s-bit protocol of conditional fidelity 1 — 2~''/(l — e). ■ 

Furthermore, the "Simple Random Hash" protocol only consists of 1-way communication. Also notice that this 
protocol is ideal, in that if the input is the perfect EPR pairs then the protocol always succeeds. 

Therefore, to achieve a conditional fidelity of 1 — 5, only log j- — log(l — e) bits of communication is needed in 
the fidelity error model. Next, we shall prove a lower bound on the communication complexity. 

Theorem 5 For any probabilistic s-bit protocol of ideal success probability p, its conditional fidelity is at most 
1 — ep/2^^^ with respect to the fidelity model of parameter e. 

Immediately from the theorem, we obtain a log( j-) — log(i) — 1 lower bound on the communication complexity for 
ideal protocols of conditional fidelity 1 — S. In the usual setting where e is a constant, our lower bound matches 
the upper bound from Theorem ^, up to an additive constant. Interestingly, the theorem is proven for protocols 
that only output 1 qubit pair. However, this lower bound is good enough in that it matches the upper bound 
of the Simple Random Hash protocol, which in fact outputs many qubit pairs. In this sense, the communication 
complexity is "oblivious" of the yield of the EDPs. This is quite counter-intuitive. 

Proof: WLOG we assume the protocol only outputs one qubit pair. Consider a particular input state 



Po = (l-e')*n + e'-^ (18) 



It is a mixture of the perfect EPR pairs (with probability 1 — e') and the completely mixed state (with 

probability e'). Notice that F{^) — So if we set e' = ^|„_j^ e, then we have F(p) = 1 — e. We shall prove 

that no deterministic, s-bit protocol has fidelity more than 1 — 2^'^'^^^^ep over state po, which will imply that no 
probabilistic protocol can have fidelity more than 1 — 2^(''+^)ep, too. 

We fix a deterministic protocol V. WLOG, we assume it proceeds in rounds: in each round, one of the two 
parties (Alice or Bob) applies a super-operator £ to his or her share of qubits, and then sends one (classical) bit 
to the other party. The protocol consists of s rounds: one bit is sent in each round. Finally, Alice outputs the 
special symbol, determining if the protocol succeeds or fails. 
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To analyze the behavior of the protocol V over the input poi we consider how V behaves over state and 
state 5!^, respectively. We use p (resp. q) to denote the probabilities that V succeeds over state (resp. ^kt)- 
Notice p is in fact the ideal success probability of protocol V . Then it is easy to see that 

. (i-OP-f-(m;)) + .Vf-(n^)) 

(1 — e)-p + eq 

Notice that we always have F'^i^Pi^n)) < 1- Since ^jtt is a disentangled state, 7^(2!^) is also disentangled. 
Therefore we have F'^{'P{-^)) < 1/2 by Lemma |l} We shall prove that 

q > //2^ (20) 

which will imply that 

< vi;::^: ^ ' - ^.TT^T^ ^ > - 

Now we prove that q > p^/2''. We analyse 2 cases separately: in case I, the state vj/^ is the input to the 
protocol; in case II, the state is the input to the protocol. For each case, we keep track of the local density 

matrices of Alice and Bob. In case I, we use rj.'^ and rj^'^ to denote the local density matrices of Alice and Bob 

after the A;-th round; in case II, we use rj^'^ and rj^'^ , respectively. For fc = 0, we define the Tq'^, t^'^, t^^'^, 

and Tg to be the density matrices at the moment that protocol starts. 

We give more definitions: after the fc-th round, there are 2'^ possibilities depending on the first k bits commu- 
nicated. For any binary string t g B'', we use a}'^ (resp. <T('^) to denote the local density matrix of Alice (resp. 
Bob) after the fc-th round in case 1, conditioned on that the first k bits communicated so far are t[0], t[l], — 1]. 

We use p] to denote the probability that this happens (that the first k bits are i[0],t[l], ...,t[k — 1]). Obviously 
we have p\ = p\.Q + p\i for any t E . Furthermore, we have the following equalities 



Y.p\ = I (22) 



EI l,A l,A /r,o\ 

PfO-t = Tfe (23) 

EI l,B l,B /r,^\ 



We define cp''^, a^'^ ^ and for case II, similarly. 

We use ^ to denote the empty string. So we have = — 1. 

One important observation is that when the protocol starts, the local density matrices for Alice and Bob are 
identical in both cases: 

I, A l,B II, A II, B I f^r-\ 

When the protocol proceeds, the local density matrices in two cases will become different, since the state ^„ 
is an entangled state, while is not. However, they cannot differ "too far", as we shall prove in the following 
lemma: 

Lemma 4 For all k = 0,1, s — 1 and all t € B^ , we have • CTj'^ < cp'"^ O'l^d Pt ' ^t'^ ^ '^l^'^ ■ 

Proof: By induction. The base case is obvious. Now the inductive case. Consider the situation at the end of 
the fc-th round. Suppose the first k bits sent are /:[0],i[l], ...,t[k — 1]. WLOG we assume that in the (fc + l)-th 
round, Alice applies a super-operator £ to her share of qubits, and send one bit a to Bob. 

First we consider the density matrix for Alice. Notice that in general, a is the result of the measurement from 
£. Therefore, we can "split" £ into two positive super-operators £q and £1, such that 

£o{'^t ) = ■ CTt o (26) 

Pt 
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^0(CTt ) 



Pt;l \,A 
■ ^t-\ 

Pt 
JI 

II ■ '^t-O 
Pt 

JI 

Pt;l U,A 



Pi 



IT • 



(27) 
(28) 
(29) 



Intuitively, So corresponds to the case that a = is sent, and £i corresponds to the case that a = 1 is sent. 
By inductive hypothesis, we have 

I I, A , II, A 

Pt ■ o-* ^ o-* 



Combining (|2^) and (H) with Claim | yields that 



I l.A c / I l,A\ J c I n,Ax 

Pt-fi-(^t-fi =So{Pt-(^t ) ^ to(o-f ' ) 



Combining ^ and (H) with Claim | yields that 



JI 

Pt-fi II 4 



Pt 



, II,A 
II ■ '^t-fl — ^t;0 



(30) 



(31) 



I 1-4 c / I I,An , c / II, An 



JI 

^^t;l II,A 



Pt 



, II,A 



(32) 



Now we consider the local density matrix for Bob. In case I, the qubits between Alice and Bob are entangled. 
Therefore, the bit Alice sends to Bob carries some information about his state. In terms of the density matrix, 
Bob's local density matrix will "split" from crj'^ to al:^ and 0^!^ . Notice that Bob doesn't perform any operation 
to his qubits, and thus we have 



I I 

I,B Pt;0 I,B , Pta l,B 

<^t = ■ O't;0 r ' '^t:! 

Pt Pt 



(33) 



In case II, the qubits between Alice and Bob are disentangled. Therefore, the bit sent by Alice carries no 
information about Bob's own state. Thus Bob's local density matrix remains unchanged. Thus we have 



II,B II,B 



II.B 



By inductive hypothesis, we have 



I I,B , II, B 
Pt ■ ^t di cr* 



Combining (33), (p4), and (35), we have 



(34) 
(35) 



I I,B , I I,S , II. S 

Ptfl ■ C^t;0 ^ Pf^t ^ CTj 

I I,B , I I,B , II, B 

Pta ■ CTt i ^ Pt-cFt' <(rt 



II,B 
II,B 



(36) 
(37) 



So the inductive case is proved. 



Now we are ready to prove (|20|). After s bits are send, Alice will decide whether to succeed or fail. In case I, 
we use Tt to denote the probability that Alice choose to succeed conditioned on that the bits communicated are 

and thus by Lemma H, we know that in case II, the success 



f[0], i[l], t[s — 1]. Notice we have jj{ ■ a]'^ ^ crp'"^ 



probability is at least • rf . 
Therefore, we have 



Q > ^rt-pl-p] 



(38) 
(39) 
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which impUes that 



q > 



(40) 



> 



> 




(43) 



(41) 



(42) 



This proves the theorem. 



6 Conclusions and Future Work 

In this paper, we studied the classical communication complexity of entanglement distillation protocols in the 
setting of incomplete information, where the input states are mixed states or prepared adversarially. We study 
on the precise communication complexity of the protocols, as opposed to the asymptotic results. We also focus 
on the communication complexity of EDPs of the minimal requirement on yield, i.e., only 1 qubit pair is required 
as output. To the best of our knowledge, this is the first paper that studies classical communication complexity 
in the incomplete information setting, and also the first one to study the precise communication complexity. In 
our setting, many techniques don't work any more, e.g., the Law of Large Numbers, the Central Limit Theorem 
(both only works in the aggregated setting, where one has many copies of the identical object), and the conversion 
from multi-round protocols to a single-round protocol (it requires that the input state is pure, and Alice and Bob 
have the complete information about it). 

We considered 3 error models of the input state, and proved 3 corresponding results. The first 2 results are 
the "base cases" for the measure-r and the depolarization models. The result upper-bound the maximum possible 
fidelity of 0-bit EDPs (i.e., EDPs that don't employ any communication). Interestingly, In this case, the trivial 
protocols that outputs a random pair are already optimal (or near-optimal, in the depolarization model). Despite 
of their simple statement, these results seems non-trivial to prove. A technique in the proof is an alternative 
definition of the fidelity of pure states. The technique may have its independent interest. The third result 
is an almost tight lower bound on the communication complexity of EDPs with respect to the fidelity model. 
Interestingly, although the lower bound is proven for protocols of minimal yield, it matches the upper bound 
given by a specific protocol that has very high yield. In this sense, the communication complexity seems to be 
oblivious to the yield of EDPs. This observation is somewhat surprising, since this is not the case for QECCs. 

We view our paper as a first step toward the much greater project of understanding the communication 
complexity of EDPs in general. We feel that this paper opens much more open problems than the ones it solved. 
We list some of the open problems that we feel interesting: 

1. More Lower Bounds 

Our first 2 results on the measure-r models and the depolarization model arc indeed the "base-case" resiilt, 
in that they only solved the problem where there is no communication at all. What happens when there is 
communication? In particular, in the measure-r model, if r = 1 and n > 3, then there exists deterministic 
EDPs of fidelity 1. This contrasts with the results that the maximum fidelity of 0-bit probabilistic EDPs is 
1 - l/2n. What about 1-bit EDPs? 

2. Tighter Lower Bounds 

Our result on EDPs with respect to the depolarization channel is not tight: we managed to prove an 1 —p/2 
upper bound on the fidelity of 0-bit EDPs, but the lower bound given by the trivial protocol is 1 — 3p/4. 
We conjecture that 1 — 3p/4 is the right upper bound but was unable to prove it. 

3. EDPs with Initial Entanglement 

Our paper didn't consider EDPs where Alice and Bob share some initial entanglement (possible in the form 
of EPR pairs). How would the initial entanglement affect the communication complexity? 
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4. Deterministic vs. Probabilistic EDPs 



All the results in our paper are proven against probabilistic EDPs, where Alice and Bob share a classical 
random tape. Can one prove stronger results against deterministic EDPs? Is there a trade-off between the 
amount of shared randomness used and the amount of classical communication? 
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A Proofs to the Results for the Depolarization Model 

Proof: [to Theorem § 

Notice that by changing the basis, we can write the density matrix, pp, in another form: 

P, = {l-P)- |cf>+>($+| + I . (|00)(00| + |01)(01| + |10)(10| + |11)(11|) 

which gives another interpretation of the depolarization modeh each EPR pair, is kept intact with probabihty 
(1 — p), and is replaced by a completely mixed state with probability p. 

This observation leads us to consider a related error model, namely the "random-corrupt" model. In a random- 
corrupt model of parameter r, r EPR pairs are randomly chosen from the n pairs and are "corrupted" — meaning 
being replaced by the completely mixed state i(|00)(00| -I- |01)(01| -I- |10)(10| -I- 

It is easy to see that a depolarization error model of parameter p is simply a mixture of the random-corrupt 
models, with probability (")p''(l — p)"^'' being of parameter r. 

We shall prove that the maximal fidelity of any 0-bit protocol over the random-corrupt model of parameter r 
is 1 — r/2n. This will imply our theorem, since we have 



r-=n ^ 



-) 



1 - ^ 



As in the proof to Theorem |l], we only consider deterministic protocols. 

We present more notations and definitions. As extended indicator vector, often denoted by u, is an n- 
dimensional vector, whose each entry is an element from {00, 01, 10, 11, Its degree is the number of entries that 
are not *. There are 4''(") extended indicator vectors of degree r. Each extended indicator vector u corresponds 
to a unique bipartite state | V-'u) in the following way: 



|0)^ 


0)^ 


if v[j] 


= 00 


|0)^ 


1)^ 


if v[j] 


= 11 


ll)^ 


0)^ 


if v[j] 


= 10 


ll)^ 


1)^ 


if v[j] 


= 11 






\fv[j] 


— * 



n-1 

I V'u) = I where 

3=0 



We call such an | V'u) an extended error state. 

An 2n-dimensional 0-1 vector x is consistent with an extended indicator vector u, if x[j];a;[n + j] = u[j] for 
all j such that v[j] ^ and x[j] = x[n + j] for all j such that v[j] = *. We write this as a; C u. There are 2""'' 
0-1 vectors x consistent with an indicator vector of degree r. We view x as the concatenation of 2 n-dimension 
vectors: x — l]r, and we write them as I = LT(a;) and r — RT(a;). 

With the notations, we can write the extended error states as 



1 



2(n-r)/2 ^ 



ij{x))^\Rj{x)y 



(44) 



We define the discrepancy of x to be DIS(a;) — LT(s) RT(s), where © stands for bit-wise XOR. The degree 
of discrepancy of x is |DIS(a;)|, the Hamming weight of DIS(x). Clearly, there are (^')2" 0-1 vectors of dimension 
2n having degree of discrepancy d. Furthermore, if x has degree of discrepancy d, then the number of degree-r 
extended indicator vectors u such that a; C u is ("Z^)- This is because for every j such that x[j] ^ x[n + j], we 
must have u[j] — x[j]; x[n + j] in order to have x Qu. So the only freedom for u is to put (n — r) *'s in the n — d 
places where x[j] = x[n + j]. 

Now we consider an arbitrary 0-bit protocol. We model it as Alice and Bob both applying a unitary operation 
to their share of qubits, outputs the first qubit and discard the rest. Suppose the unitary operators of Alice and 
Bob are Ua and Ub- We denote the states under these operations by 



Ua\x) 
Ub\x) 

Then as in the proof to Theorem |l|, we shall prove that 



dcg U— r 



J2 {^u\iUA®UB)HU®U*){UA®UB)\M 
U£{I,X,Y,Z} 



(45) 
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which will imply our theorem. 
Notice that 

{Ua ® Ub)\ V'u) = ^7;^ E I ^LT(.))I V'RT(x)) 



2(n-r)/2 



and so we have 

(Vu|(t/A®C/s)t(C/0C/*)(f/A®C/s)|Vu) = E('^LT(.)|C^I'^LTfe))-(^RT(.)l^lV'RT(,)) 

xCUyCU 

So we only need to prove that 

E EE E (^LT(.)|f^lW-<^RT(x)|f^1V'RT(,))<4(l-^) (46) 

Vr/ dcgU=r2;CUj/!ZUC/e{/,X,y,Z} 

By Cauchy-Schwartz, we have 

E EE E (<^LT(.)|f^l'^LT(,))-(V'RT(x)|t^1V'RT(,)) 

degU=r xnUynuUe{I,X,Y,Z} 

W E EE E l(^LT(.)|t^l^LT(.))'^ 

\deg U=r xCU j/CU Ue{I,X,Y,Z} 

E EE E l(^RT(.)|t^*I^RT(.))l^ 

^^degU=rxCUi/CU!7£{/,X,y,Z} 

Now we estimate 

E EE E K<^LT(.)it^i<^LT(.)>r 

deg U=r xCU yCU Ue{I,X,Y,Z} 

Notice we can write a; as a; = LT(x); (LT(x) ® DIS(a;)) and y as y = LT(t/); (LT(t/) © DIS(7/)). If there exists an 
extended indicator vector u such that x C u and j/ C u, wc must have DIS(x) = DIS(2/). This is because that for 
every j such that DIS(a;)[j'] = 1, x[j] and x[n + j] differ. Thus we must have v[j] = a;[j];a;[n + j], which implies 
that v[j] = y[j];y[n + j], and DIS(y)[7'] = 1. In fact, for every j such that DIS(a;)[j] = 1, we have x[j] = y[j] and 
x[n + j] = y[n + j]. 

So we have 

E EE E l(</>LT(.)|C^I^LT(,))P 

dcg U=r xHU t/CU Ue{I,X,Y,Z} 

= E E E \i<f>^\u\h)\'j: E 1 

aSB" faeS" C/e{/,X,y,Z} cGB" degU=r: [(a;(aec))CU]A[(b;(bec))CU] 

by a substituting a for LT(x), 6 for LT(y), and c for DIS(a;). 
Now we fix a and b, and compute 

E E 

ceB" degU=r: [(a;(a©c))CU]A[(6;(6®c))CU] 

We define k = \a(B b\. For every j where a[j] 7^ we must have c[j] = and u[j] = *. For every j where 
a[j] = i>[j], if we have c[j/] = 1, then we must u[j] = a[j]] (a[j] ® 1); if we have c[j] = 0, then u can be cither 
a[j']; a[j'] or *. Therefore, of n — positions where a[j] = 6[j], r would be chosen where vcu has a non-* entry. Of 
these r places, one has the freedom to choose c\j] = or c\j] = 1. For all other places, c\j] = and u = *. So we 
have 

ceB" dcgU=r: [(a;(aec))CU]A[(&;(&ec))CU] 
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In other words, 



E EE E i('^lt(.)|c^i<^lWI'- E E E \{4>a\u\^,)\'-r 

Since | (j)a)'s are orthogonal, we have 



n~\a® b\ 
r 



Also by Lemma 0, we have 



EE E l(0a|t/|0.)P<2«^ 

a b UG{I,X,Y,Z} 



(47) 



Therefore 



EE E \{^a\U\^,)\'-2 
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n- |a® &| 
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n — 1 
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EE E K'^J^i'^^)!' 



which implies which implies the theorem. 
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